A Method for Closed Frequent Subgraph Mining in a Single Large Graph
نویسندگان
چکیده
Mining frequent subgraphs is an interesting and important problem in the graph mining field, that from a single large has been strongly developed, recently attracted many researchers. Among them, MNI-based approaches are considered as state-of-the-art, such GraMi algorithm. Besides subgraph (FSM), closed was also developed. This practical applications fundamental premise for studies. paper proposes CloGraMi (Closed Frequent Subgraph Mining) algorithm based on to find all graph. Two effective strategies first one new level order traversal strategy quickly determine searching process, second setting condition early pruning portion of non-closed candidates, both them aim reduce running time well memory requirements, improve performance proposed Our experiments performed five real datasets (both directed undirected graphs) results show requirements our significantly better than those GraMi-based
منابع مشابه
A Parallel Approach for Frequent Subgraph Mining in a Single Large Graph Using Spark
Frequent subgraph mining (FSM) plays an important role in graph mining, attracting a great deal of attention in many areas, such as bioinformatics, web data mining and social networks. In this paper, we propose SSIGRAM (Spark based Single Graph Mining), a Spark based parallel frequent subgraph mining algorithm in a single large graph. Aiming to approach the two computational challenges of FSM, ...
متن کاملGRAMI: Frequent Subgraph and Pattern Mining in a Single Large Graph
Mining frequent subgraphs is an important operation on graphs; it is defined as finding all subgraphs that appear frequently in a database according to a given frequency threshold. Most existing work assumes a database of many small graphs, but modern applications, such as social networks, citation graphs, or proteinprotein interactions in bioinformatics, are modeled as a single large graph. In...
متن کاملSupport Computation for Mining Frequent Subgraphs in a Single Graph
Defining the support (or frequency) of a subgraph is trivial when a database of graphs is given: it is simply the number of graphs in the database that contain the subgraph. However, if the input is one large graph, it is surprisingly difficult to find an appropriate support definition. In this paper we study the core problem, namely overlapping embeddings of the subgraph, in detail and suggest...
متن کاملA hybrid method for frequent closed patterns mining in large-scale high dimensional biological data
Large high dimensional biological data have posed great challenges to most existing algorithms for frequent patterns mining. In this paper, we propose a hybrid method to find all frequent patterns: we firstly decompose the mining task into two subproblems (discovering long frequent patterns and short frequent patterns), then we choose different algorithms to solve the subproblems respectively. ...
متن کاملFrequent Subgraph Mining from Streams of Linked Graph Structured Data
Nowadays, high volumes of high-value data (e.g., semantic web data) can be generated and published at a high velocity. A collection of these data can be viewed as a big, interlinked, dynamic graph structure of linked resources. Embedded in them are implicit, previously unknown, and potentially useful knowledge. Hence, efficient knowledge discovery algorithms for mining frequent subgraphs from t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2021
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2021.3133666